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1, Introduction 

The an Coma t Ion of first order logic lias received comparatively 
Little attention from researchers intent upon synt tie siring the theorem 
proving mechanism used by humans* The dominant point of view [tF] ± [IB.] 
has been that theorem proving on the computer should be oriented to the 
capabilities of the computer rather that to the burton nind and t tier afore 
one should not be afraid to provide the- computer with a logic that humans 
might find strange and uncomfortable* The preeminence of this point of 
vi4W is not hat'd to explain .since until now the moat successful theorem 
proving programs have been machine oirttitod. 

Nevertheless t there are at least two r&ASPn* for being disSati-S” 
fied with the machine oriented approach, First * a mathematician often is 
interested more in understanding the proof of a proposition than in being 
told that the proposition is true, fox the insight gained fros an under¬ 
standing of the proof tan lead to the proof of additional propositions and 
the development of new mathematical concepts, However* machine oriented 
proofs can appear very unnatural Co a human matherne t itisn thereby provid¬ 
ing him with little if any insight* Second, the machine oriented approach 

*ThiS work was conducted ac The George Washington University Pro¬ 
gram in Logistics and supported under Contract NDDD14-&7-A-D214* Task" 0001 h 
Project MR 347 020* U * 5 * Office of Naval Research. The author also is grate 
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the use of their computational facilities* 



lias failed to produce a computer program which even conies close to equaling 
a good human mathematician in theorem proving ability; thia leads one to 
suspect that perhaps the logic being Supplied to tha machine is net as effi¬ 
cient as the logic used by humans. 

The approach taken in this paper has been to develop a theorem 
proving program, as a vehicle for gaining a better understanding of how hu¬ 
mans actually prove theorems. The computer program which has emerged from 
this study is based upon a logic which appears more "natural'^ to a human 
(i.e., more human oriented). While the program is not yet the equal o£ a 
top flight human mathematician, it already has given indication (evidence 
of which IS presented in section 9) that it can outperform the best machine 
oriented theorem pmvera- 

Some work was begun in [2], 1 3 .I, [7], Lib] and Q1 7l directed 
toward the introduction of a reasoning by cases mechanism into automatic 
theorem proving. One can give many examples where humans use such a mech¬ 
anism. Thus, in proving set A Is Identical to sot E, usually cne will 
attempt to prove two cases; (!) A la a subset of B and (2) E is a Subset 
cf A. In proving that a system is a group, usually one will attempt to 
prove two cases; {1} if two elements ars in the group, then the product 
is in the group and (2) if an element is in the group, then its inverse 
la in the group. In proving a thecreffi by induction, one will prove a 
basis case (i,e,, that it i* true for say n=l) and also an induction case 
(i.e. if it £3 true for n, thee it must be true for n+ 1 ). 


However, some seriouB obstacles have prevented the effective 
use of reasoning by caseE by previous automatic theorem provers, These 



obstacles- relate to the overall {i.e*, global) organisation of these pro¬ 
grams. Amy problen solving program meat have an executive Tontine which 
controls the allocation of its computational resources. The executive 
routine must have A procedure for (1) determining when and how to create 
a goal and (2) preventing an explosion of goals from taking place* Pre- 

vious theorem proving executives [3], Lit)], [17] made the mistake of asso- 

1 

ciating A goal with every new formula that got generated. By contrast , 
the present program first rakes an intensive attempt to solve a goal {which 
typically will generate a nunber of new formulas for further processing 
under the control of a lacal executive) before deciding to generate a new 
goal* As viewed by the c loha l executive! new goals get created only when 
a goal gets Split into subgoals [where the logical basis for the split is 
a reasoning by cases argument)* The rationale behind this is that the 
x^sourctS lor solving a goal without splitting are considerable (i*e* t 
repeated use of modus ponens, the equality inference rule, rules for 
simplifying formulas, etc*) and therefore one should provide substantial 
opportunity for solving the goal before introducing additional goals. 


"'"This mistake w*£ net made in [ 2 ] and [7] where procedures were 
presented for breaking a theorem into component cases which then are 
attacked by Standard theorem proving techniques* However, thoE* procedures 
do not provide the means by which the application of the standard techniques 
Can interact with the mechanism that splits the theorem into cse-es, Thus,, 
if Che Standard techniques are inadequate for solving e case, the informa¬ 
tion generated during the unsuccessful attempt is lost and will not contri¬ 
bute to the process of creating new subcases. 



Nevertheless* the splitting of goals into Eubgoals i& often naees“ 
aary and if not controlled it could overwhelm the executive with more goals 
than it could handle* The hey ee controlling this growth (ami it he a 
demonstrated Its value in the present program! stems from the recognition 
of the fact that it is not always necessary to solve each subgoal created 
by a Eplit in order to solve an ancestor goal. For example, suppose the 
pfCgrss. knows that It can prove a goal G by proving A and S. So it first 
attempts to prove A and generates a number of now formulas in this attempt * 
However* suppose that, although it atill cannot prove A, it nevertheless 
knows that it alao could prove G by proving C and D. The program then would 
attempt to prove C and D and would view these attempts as subgoals of A. 

The reason C and D arc regarded as subgoals of A Is that their proofs could 
depend upon Information derived from A. In this case* the solution of G 
and IJ would mean that we still would, have to prove E in order to conclude 
G* However, if the proofs of C and D did not utilise information derived 
from A, then we would not Have to prove E and could bring the proof of the 
ancestor goal G to an immediate conclusion. Similarly, If the proof of C 
did not depend upon information derived from C, we could skip over the 
proof of D. 

Eros the viewpoint of human problem, solving,* we can regard these 
splits as "strategies"' by which some an tea ter goal ia to be solved (it need 
not be the immediate ancestor) . The program (and alao a human) may have a 
number of different strategies under consideration simultaneously ■ In tha 
course of implementing these strategies s the program discovers which strate¬ 
gics are of assistance to other strategies, which strategies cust be completed* 
ar.d which strategies -can ba discarded. 



2, PrelimtnaryConcep ts 


The reader ie referred Co [II] for n rigorous treatment of the 
basic definitions and ground rules used by tost programs which prove 
theorems in the first order predicate calculus. An informal description 
is presented in this section in order to Sure that the reader is 

acquainted with the basic ideas. 

We use the four logical connectives "" f P {"implies"), 

^ ("and 1 ' 1 ), v r ( M or' r ) as well as the universal quantifier y ("for all") 

and the existential quantifier 3 ("there exists"). We assume an infinite 
Supply of variables, constants, functions and predicates. 

Terms : A variable is a terra ^ a Constant is a tern, and a function 
f (tj , ., .,t Q ) is a tent provided that its arguaenta t^...^t are all terms. 

Atonic formulas : A predicate P{t ^ r * * * , C ) is au atonic formula provided 
that its arguments t^,,,.,^ at* all terms. 

literals i An aconite fcmule A is a literal and the natation pf an atomic 
formula '■Aha literal. 

Formulas i An atomic formula is a formula, If A and 3- arc formulas > then 
~ Ai and A^U are formulas, IfA(jt) is a formula that may 

depend upon 90En£ variable x, then YxA{:x) end IkA(k) ire formulas. 

Given the desire to prove that Che formula A. logically follows 

from the formulas .we will assume that the formulas „.,, *A * 

— A. ^ are all true end then attempt to derive a contradiction. The first 




to prenex form 


step ie to convert each of tb£ formulas 
using a standard procedure [9"J t A formula which is in prentet form has all 
ICS quantifiers (If *ny) At the front of the formula. The ncett seep la to 
remove first all the existential quantifiers and then all the univ&rsal 
quantifiers. An existential quantifier is removed by replacing the CKiswn- 
tial variable it quantifies by a function of those universal variables whose 
quantifiers appear to the left of the existential quantifier in question 
[ll] . P°r example* in the fotmula ^uVvSw^hP (u ,k , w p v) „ the existential quan- 
tifJer which quantifies the variable w would be reaoved and w would be 
replaced by a function of u and v, say £(u,v}, since these variables are 
universally quantified to the left of v. The resulting formula therefore 
would be VuVvVx F (u p x , f {v, v} p v) . Ifniversal quantifiers then are dropped with 
the understanding that the variables in question arc to be given universal 
interpretations (i.e. if A{x} is assumed to be true, then it is nuuumsd to 
be true for all possible values of the variable 5t)* 

At the heart of all current theorem proving programs is the use 
of ‘Watchinjj 11 routines. For example* suppose the program established that 
the foraulaE A(x,a,s) and '"■A(bjyjb} both follow from the as s map Elans of 
tha problem where x and y are variables and a and b arc constants. A catch¬ 
ing routine then would determine for the program that variable x should be 
set equal to b end variable y should be aet equal to a in order to elimin¬ 
ate the sources of difference between the formula Afx F a,xJ and the formyls 
A(b,y,b) + Since A(x P a ? iO nuat be true for x^b fes it is true for all values 
of x) and ^(bjyjb) must bo true for y*A (afi it l .S true for all value e of yO p 
this afeans that the formula Alb^h) would contradict the formula —■ A(b p a,b) 
and the program would have obtained a proof. 



3, "file Logical Deduction Rules 

The use of matching routines is implicit in the rules of infer- 
ence to be described in this section* Thye, when express Ions .1 and E' 
appear in the statement af an inference rule, it should be understood that 
E and E 1 represent expressions which have been mace Identical by means of 
a matching routine. For example „ the application of A(b,y>b} t A(x,a t x)^B(i) 
to rule E6 would mean that A(bjy,b) would be made identical to A(x,a^x) by 
setting x=b* y™a and this would reEult in the output B(b} where A(x»a,x)* 
Afb^yjb] y E{ji> ibd E(b) play Che role of A, A', B, and, B' respectively. 

As described in section 2, we begin with a set of formulas all 
pf which ere assumed to be true, hew formulas arc established with the 
help of rules E2 through Rll. Rule R1 determines when a problem has been 
solved. 

Rl* A problem is solved when it has been established that both the 
literals A" and -“A are true, 

R2* Replace formula —-A by A. 

R.3* Replace formula AAl by A,B. 

RA, Replace formula — (A>wE) by ~A, — B fi.e. if one wishes to 
prove A\^E, then either prove A or prove B). 

R5 * Replace formula (A by A, “~B (i.e. if one wishes to prove 
A 2>E„ then assume A and prove B) . 

Kb* (Mody* ppnens) If it has been established that A 1 and A^E are 

true where A is a literal, then B 1 tp the set of true formulas. 

RT. (Modus ponens) If it has been established that—E' and ASE are 

true Where E Is an atocit formula t then add ~A r to the set of true 


formulas. 



RSi (Modus ponens) If it has beet* established that B 1 and AD^B are 
true where iJ la an atomic formula* than add '•-A 1 to the set of 
true formulas* 

R9. (Reasonihg by cases) Split AV'B into case A and case E as shown 
in section 4* 

R1Q, (Reasoning by cases) Split -- (A^E) into case ~ A and Case 
as shown in section 4. 

PJI, (Equality relation) S£ it has been established that t=C And A{r’) 
are true where A{f*) is a litoral that depends upon the tern 1 , r 1 * 
then add A(t’) to the set of true formulas, See section 7 for a 
more complete discussion of this rule* Also* aee section 3 for 
the treatment of functions that Are either bath associative and 
commutative or juht associative as the program has Special routines 
which incorporate Such functions into the equality relation, 

K12, If It has been established that P(t^**., t t^) and ~PCt£ .L^) 

are true literals where P is not the equality predicate and if 
t_j. has been made identical CO t! by means of a match for all ir^j 
but this match fails for i"j , then add ''“(t^t!) to the set of 
true formulas. 

Of the machine oriented Approaches to automatic theorem proving* 
by far thE most important have been those based upon the resolution prin¬ 
ciple [j_ 5 j . Although resolution is based upon a single Inference rule 
for generating new formulas, it is convenient to regard It as two rules 
— called unit and nen-unit resolution respectively. Unit resolution 
says that if it has been established that and ~ A^\/V„. *. .A^ art 




true where Aj , , - ., , are literals, then we can add v'■ ■ --A^ te 

2 

the set ef true formulas. From the standpoint of rule R9 > the formula 
—A^ V A^s/ *. ■ i can he thought of as representing r cases given by the 

r literals ~ A^, Aj, < < . A , The application of unit resolution to the for¬ 

mula “A^ v/ A^y » d , + +A^ then has the worthwhile property that it produces 
as output * foraula with one less case £i + e. Al V has only 

r-1 cases) , CtL the other hand, non-unit resolution takas as input the 

two formula! A. V A„\/ ■ , ■ + + A and Aj V .B and generates as out- 

1 & t L I t 

put A^'V . A^ v V»* T *»E ' ^ which is a formula which rule R9 would 

regard as representing r+t-2 cases. Since r> 2 and t. > 2 * it follows that 
r+t-2. ^man{r 1 1). This mcariB that the output formula of a non-unit resolu¬ 
tion will have at least AS canny and often more cases than cither of its 
input formulas.. Since the ultimata objective is to eliminate all cases 
from some formula, it is understandable that non-unit resolution WOuld be 
much less effective than unit resolution and indeed researchers Soon gave 
preference to unit resolution when generating now formulas [.\D], [ 41 , how, 
since ™A^B is logically equivalent to ADE, unit resolution at least is 
related te modus pouenS (See rules Rb through RS) which is a comtcn form 
of human reasoning whereas non-unit resolution appears very unnatural tc 
a human. The Suspicion is quite strong that h'jmans do not use non-unit 


As in the other deduction rules described in this section, we 
are adopting the convention which considers expressions written as & end 
E 1 as having been Ziade identical by =eans of a matching routine. 




resolution when waking deductions but do use a form of unit resolution 
precisely because non-unit resolution is so much less efficient than unit 
resolution* 

There is no mechanism in resolution for breaking a difficult 
problem into two or more simpler sybproblams* Yet this is a common fea¬ 
ture of human problem solving* The value of such a mechanise is that it 
is generally easier to solve a ntrmbcT of simp 1 b problems than it is to 
solve a single hard problem. Tho present program uses reasoning by cases 
as the logical basis £or generating a goal-subgoal hierarchy aad this is 
described, in section 4. However,, the follcywing simple example can help 
to illustrate the main ideas. 

F.vaTiplg; Wq wish t* obtain a contradiction from the following six axioms 
(Al through A6) where x* y* end z represent variables and a, b t and c 
represent constant*. 

Al* Kt*ty*s} ■ (x#y)«z 
Al * P (x*y ) v - P (x:) V (y ) 

A3, P[a) 

A4 + P (b) 

A5, P <fi) 

Afs. "* P (a * [b*c)) 

The program first attempts to find a contradiction by repeated 
use of all the deduction rules except R9 and RID, At this stage, the only 
new formula it can generate is A7 which is obtained by substituting Al into 
AG using rule Ell." 

3 

Strictly speaking t we are subsclcueing Che right side of Al into 
Ab after the match between 5 f*(y*s) and a* (bat) produced x*a a y-=b t and z“c* 



A?. ~P((a*t>)*t) 

Actually* the program would not have considered A7 as distinct from AG 
If it had been told that the symbol * satisfies the associative law A1 
(see section, £) + However* for purposes of exposition p we have aggvnaed 
in this example chat the program has no explicit knowledge that * is asso¬ 
ciative* 

The remainder of this proof can be understood Hast by referring 
to Figure 1 which describes the gee1-suhgoal hierarchy that was generated 
from this example t 


Figure 1 



The node Libeled T 1ft Figure 1 represents the top level gh&l which consists 
of the desire to obtain a contradiction from the axioms A1 through A& + 

Since after generating the formula A 7 the program finds that it has exhaus¬ 
ted its resources without obtaining a contradiction* it now torns to a 
reasoning by cases argument (i.e* rules R? or RJOJ * Axicois A3 is chosen to 
be split into Milt which become SubA&als b£ T as ehpwn id Figure l h This 
generates AS. 

AS. Ftxivy), Case 1 Of A2. 

Once again, the program attempt* to find a contradiction using all 
the resources at its disposal except rules E9 and KlO« This attempt dis¬ 
covers I 1 *’ 0 distinct contradictions. The first contradiction is between A6 
and AS far x*a, y*b*£ and generates the formula A9. 

A9. —P(a) sy ~P{b*c). Cases 2 and 3 of A2 for s-a, y-“-b*t. 

The second contradiction is between A7 and A& for x™Ei*b* y»C and generates 
the formula Alt). 

AlO, F(a*b) V "^(c), Cases 2 and 3 of A2 for x=a*b, y=c F 

After no more new formulas Can be generated not new contradictions 
obtained* the program discharges all fomulas that resulted from the attempt 
to solve ease 1 of A2 but formulas A9 and AlO now replace cases 2 and 3 of 
A2* This means that Al through A? tog other with A9 and AlO arts the only 
formulas under consideration by the program once case 1 of A2 has been dis¬ 
charged. The program, now chooses to split A9 into casea as shown in Figure 
I, This generates All* 

All, Case 1 of A3. 

A Contradiction is obtained between A3 and All. IhiE results In All being 



discharged -and Al2 generated* 

All* ~PCb+e)* Case 2 of A9, 

Again, an nttenpt is made to salve this last case (i + e H "prove 
without further use of a masoning by cases argument (l.e r rules R9 Or RlQ) . 
This attempt fnils. In fatt, it even fails to generate any new formulas. 
ILewever, Tat her than abandon, its attempt to solve this ease, the program 
splits AID and considers the formulas A13 and AI9 that result from the split 
as Subgoale of A12 as shown in Figure 1* Although the solution cf AlJ end 
Al$ would solve T, they are treated as suhgoals of Al2 since the program has 
no a priori way of knowing whether the information derived from Al2 will be 
needed in the solutions to A13 and Al^S The program now attempts to Solve 
A13. 

All. ~P(s*b). Case 1 of A 1.0. 

The attempt to solve AL3 also fails so an attempt is made to split A2. 
Although the solution of the goals treated by the Split of A 2 would solve 

T* these goals are attached aa subgoals of A13 as shown in Figure 1 since 

information provided by their ancestor goals might help in their solution. 
Indeed, it should be recalled that kl wo* split onto before and resulted in 
the formulas A? and AID, The reason another attempt 1 e made to Split A2 is 
that new formulas (i.e, A1Z and A13) now are available which were not avail¬ 
able during the previous attempt. 

The program now attempts to solve A14+ 

A1G, P<X*y). Case 1 of A2. 

This second attempt to split A2 results in two new contradictions. The 

first contradiction is between AI2 and A14 for i™b, y*c and generates the 


formula A15. 



Al5» ~F(b) v-P(c) + Cases 2 and 3 of A2 for x=b* y=c. 

The second contradiction is between A13 and A14 for x-a> y=b and generates 
the formula Aid. 

AJ&i "'Pi'tO . Cases 2 and 3 of A2 for i!=fii y=b. 

After no more new foraulas tan be generated nor new contradictions 
obtained* th* program discharges all formulas that resulted fron the attempt 
to solve case 1 of A2 but formulas A15 and AlS bow replace cases 2 and 3 of 
A2. The program now choose* to split AlS into subgoalfc of A13 *3 shown in 
Figure 1, 

A17* ~F(b). Case 1 of AlS. 

A contradiction is obtained between kh and A17- This results in A17 being 
discharged and AlB generated. 

Aid. ^{c), Case 2 of AlS. 

A contradiction is obtained between AS and A18. Now since both the genera¬ 
tion of A17 and AlS as well as the solution nf these goals in no way depen" 
ded upon the parent goal A13* the program can conclude the proof of A12 

ii 

immediately without attempting the proof of A19, By contrast* the solution 
of A12 did Involve information derived from, A12 since its solution depended 
upon AlS which was derived from A12, However, since A12 is the last goal in 
the split of AS* this allows the program to conclude that it has a Solution 
to the top level goal I. 

^Although Ln thiE simple example the proof of A19 would have been 
trivial, cue cannot tx.pe.CC tn be so fortunate in general. 



The above example by its very siapliticy failed to eE.plis.5ize the 
role played by the equality inference rule and modus poneris in the 
present theorem, proving program. a very important feature o£ the program is 
it a mechanism for cofit roll lag the number uf fGriauLas generated by the 
equality inference rule and this is described in section ?■ Also, a fortsula 
coded as A^iE is used with modus ponenS as a means for solving a case 
whereas the same formula coded as "■'AVE would he used in generating the 
separate cases ~A and L One encounters a number of formulas such, as 
k=>2k*z = y*z wh&9& usefulness lies in the help they provide Ld solving 
a. case rather than as a means for generating separate cases H The program 
has facilities for exploiting such formulas and these facilities are 
described in section 5. 


Reasoninfi by Cases 


The initial data given to the program iE the list (L^L^} where 
is a list of formulas from which a contradiction Is to be found and is an 
empty list. A formula F is removed from and prepared as a possible input 
to one or more of the deduction rules , F is not allowed to be of the type 
AvE or -- since we wish to make a concerted attempt to solve the problem 

before splitting it into s lih problems. If F can be applied Successfully to a rule 
which requires only one input (i.e. rule R2, Rl* R4 and S5), then F is discarded 
and the output from this application of the deduction rule is pieced on L ^. 

For example* if F is the formula ^CADB) , then F would be discarded and instead 
the formulas A and would he inserted on L^. 

If F cannot be applied successfully to a one input deduction rule, 
then an attempt is made to apply it to a two input rule by using a formula 


from Lq SS the Second input ► A successful application to such a rule wCuld 



dean that the output would be placed otl I,j unless the rule is ill In which 

cast a contradiction would be found, After every formula on Lq has been. 

paired with F as the second input to each deduction rule* the formula F 
, „ 5 ^ 

gets placed on Lg ■ ft new formula F now la reaoved from L.. and the pro¬ 
cedure repeats itself until cither e contradiction has been obtained, the 

li-St has been exhausted except for formulas of the type A\/B or MAA6), 

6 

or a time limit has been exceeded... 

If a contradiction cannot be found in the above manner, then an 
appeal must be made to rules R9 Or RlO. Rule R9 splits $ -dL@j unction of 
the form E^x/'E^'v'. .v'tJ_ whereas rule RIO splits a negative conjunction 
of the form * If at this stage there are no disjunc¬ 

tions or negative conjunctions qb then the program must admit failure. 
Otherwise, a disjunction or negative conjunction is taken from L ] and a 
list is constructed as follows, is a List of four elements, . 

(i.e. the first element on list K^) initially represents the list 
(E 1 ,B^,E ;3l + + t , ,& f ), K t 2 is either the symbol V or the symboldepending 
upon whether an application of rule R9 or RIO is being mad*. ^ „ initially 
is an empty 1LsC , ^ initially is a list of the variables appearing in 

any of the formulas 5 ^3 T * + + * ■ Although the following discussion 

will assume that rule R0 is being applied, the treatment of rule RID is 
very similar by virtue of the logical equivalence of ^(A^E) with —AW t 


"As will be shown in section 7, a successful application of rule 
R1I to F often will cause F to be discarded immediately in favor of the 
resulting output which is placed an L^. 

^The action taken by the program if the time limit is exceeded 
f@ described at the end of the present Section. 
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Tha Et&te of affairs is summarized now by the list (L^, , L.,) 

where it a list consisting of the first formula on list (which 

in this case is E^) together with all the disjunctions and negative 
conjunctions of L-|* As before* a proof is attempted first without using 
roles R9 ot HLO* Only this time, the output of a deduction rule is 
placed on instead of on L, , a formula F which is removed from and 

applied to the deduction rules is either eventually discarded of trans¬ 


ferred to Instead of to , and the deduction rules which require 
two inputs will pair the fomula F with formulas from either L| or Lq 
instead of just from L^, 

Suppose a contradiction is obtained* How should we proceed? An 


obvious method would be to [1) remove from the list ^ of Cases 
which have not yet been solved and place it on the list K_. .* of cases 

If J 

which have been solved already, (2) reestablish Lj as it had been just 
prior to the creation of s eraae and replace it with a new 
list consisting of the next case B., (which is now the first formula 
on list together with all tht disjunctions and negative conjunc¬ 

tions Df L^„ and (4) search for a contradiction in a similar manner as 
in case 1 when B 1 had been the tonerolling hypothesis. 

The difficulty with thfc above four step method is that, in the 


solution to e£3£ 1, a variable x appearing in E^ may have been set equal 
tp some term t; if this Variabl* x also appears in one of the formulas 


B^, B^,,* + * + , then this specification that 3t should equal t must 

prevail also in at Least one of ch«s« Subsequent cases. In order to 


insure that s in fact does equal t in this latter case, the variables of 



13, s E„., , are classified into types according to whether the 

variable is to he given a "universal" or '^letentlal" Interpretation. 

Normally* all the variables would have universal lnterpretations 

if formula A(x) Is- true s then it is true for all values of the 

variable x} „ since all the existential variables already were eliminated 

by the procedure described In Station 2> However, la at Utkins Case l t 

we now place an existential interpretation on those variables which appear 

in both E^ and one or more of the formulas B^ {i.e., if 

A{x)vBW is asstmed to bo true* then we rust find Sufffe value of x which 

will permit a solution to each case). For thiE purpose, we will say 

that an existential variable has been specified if it is set equal to a 

7 

tern, t where t is not a universal variable. 

bow in first attacking Case 1* we seek applications of the deduction 
ruleB (except rules R9 and RIO) and subject these Tules to the restric¬ 
tions that no existential variable nay be specified unless the. specifica- 
tion occurs during a successful application of rule Rl (1+e+j, we specify 
an existential variable only when St doing will assure thfc proof of s 
subproblem thereby providing a valuable restriction trn the generation 
of sobproblems), 

If a successful solution is found which does not specify any 
existential variables, then we proceed directly to Case 2 by the four 

7 

Jlote that if an existential variable has been $at equal to a uhiveT’” 
sal variable, then the existential variable Is Still fret to assume any 
value which can be assumed by the universal variable- However, since a 
universal variable by definition can assume any value* this means that 
this existential variable has not really been "'tied down 1 " or "specified" 
upOh being set equal to the universal variable. 





step Mtbod described above (i.e,> ue assume E2 and discard all formulas 
that were generated during e*se 1 when EL was the hypothesis of the case) 
The reason we go directly to case 2 when no existential variable was 
Specified is that the solution of case 1 would not have committed any 
variable appearing in any of the subsequent cases ► 

However* Suppose the successful solution necessitated the Specifics^ 
tion of at least one existential variable, Then we would not proceed 
directly to case 2, Instead, the use of existential variables enables 
the program to find a nusaber of different SO Lot ions to case 1 during the 
same attempt at solving th« case; this avoids the duplication of effort 
that would appear if the program were to proceed directly to case 2 as 
Soon as it found a single solution only to find latsr that it must 
generate still Bore solutions to case 1 in order to solve the original 
problem► In particular, let x represent the vector of existential 
variables■ Bach time a solution to case 1 is obtained which resulted in 
a different specification t of the vector y,, we generate the disjunction 
of the remaining cases for this specification {ihC-, we generate the 
f crania By „ > > , ,vJ (t)) . After no more solutions can he found 

the program discharges El end all formulas generated from El except 
those disjunctions E 2 (t) VE^ (t) , , , x^V'B^ft) each of which represents the 
remaining cases associated with a different solution vector t, Rather 
than consider any of the regaining eas.es 2 through r, the program later 

;g 

Any existential variable which was left unspecified by this solu¬ 
tion would be replaced by a new universal variable in tlie formula 
COv'E^Ct) ,.,. .\yE r (t) . 





will apply a reasoning by eases analysis to one or more of these 
disjunctions EEowever* the original disjunction 

.still would. be retained for a possible future application 

of rule RE?. The reason for this retention is that subsequent applications 
of rules R9 and RID* by generating a goal-sungcal hierarchy such as 
was illustrated in Pigere 1> could provide additional formulas which 
might ferm the basis fot new solutions to case 1 of this disjunction. 

So far wc have assumed that we did not need an additional reasoning 
by cases analysis in order to solve case 1. However r if another applica¬ 
tion of rule R9 and RIO is needed,, the program zsust decide whether 
the application should be treated as a subgoal of ca$e I or whether 
the c&se analysis of E^vR. .... ,^R should be abandoned *1together. 

The heuristic that is used to determine whether Case 1 is worth pursuing 
is the presence or absence of existential variables in EZ, Thus> 
esse 1 would be abandoned if and only if an ejjisteftti&I variable appears 
in Bl, Tills heuristic also simplifies the programming sfpte it m&4P£ 
that we do not have to cOBp&re existential variable* that originated 
from different disjunctions. In any event, the abandonment of cast 1 
is not necessarily permanent. Case 1 is just postponed in favor of an 
attempt to solve the cases of some other disjunction or negative 
conjunction. This attempt Could provide the material with which to 

achieve Solutions to case 1 of .S/E when and if this 
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disjunction is reactivated at a later date.' 

9 

If there were no more disjunctions or negative conjunctions, then 
not only would the abandonment of Case 1 becoae permanent but the program 
would tenninace its attempt to solve the original problem as well. 






In general, the state of the system is described by a list 
(1 q, V-^ t Lj, f + .►►., L n h ^ where ^ contains the infor¬ 

mation tontrolling the reasoning by cases analysis of some Specifit 
disjunction or negative conjunction and is defined in a manner similar 
to K, * Thus „ K. t represents the list of cases which, have, not yet been 
solved, ^ t ^ 5 ' ^T 111 ^ 0 ! v Dr ^ defending upon whether an application 

of rule R.9 or ElO is being"made, K is the list of cases which have 

1*4 

been solved already, and ^ is the list of existential variables 
appearing in formulas an list K_. The first formula OP list ^ 
represents the case currently Linder attack and K. + K_. + j 

were generated in an attempt to solve this case. For 1 ^ i < ti, formulas 
appearing in aLready nave been processed by the deduction rules and 
art under the immediate control of K„. Formulas appearing in Lq were 
processed prior to the application of any reasoning by cases analysis. 
Formulas appearing in L . have not yet he bn p recessed by t h S d e 
duction rules. 

A new case gets initiated at what is then the lowest level of the 

goal-subgoal hitarchy. If this lowest levEl is n, then the empty list 

L , _i would be created. The first formula on list E , would be placed 
n + 1 n h l 

on L . , if K. . = v ! if K „ ■ , then the negation of this formula 

o+l n ,4 n t l 

would be placed ua ^ , An attempt first is made to solve this case 

using the deduction rules (except rules E9 and RlO) and the output of 
tbeee rules is placed on + If a formula T, which appears on 

L + p is applied CO a two input deduction rule, then the atcond in¬ 
put would come from one of the lists L„, 1» L - If F is neither 

y i. ti 





a disjunction nor a negative conjunction, then It eventually would be 


either discarded or transtarred from t* , . to L , 

ft + 1 n 

Since the application of J1|\/ ► + *■ ► ■ Vto rule R9 means that the 

program must find r solutions instead of just one solution, it is a 

matter of great concern whether r. i-,,VB r is really needed for the 

proof} for if it is not needed, then ita application to rule R$ could 

he e great waste of effort, Furthermore, if for each of these r cases , 

the program should choos* unnecessarily a formula C C ,,. , , . vC for 

t 

application to rule R9, then it would have to find rt solutions when 
only one was really needed. Clearly, the computational effort COuld 
snowball if the program is not careful abouE how it applies formulas to 
rules R9 and R10 + This is n&t just a theoretical possibility. In the 
course of searching for a proof, it is not unusual to generate many 
irrelevant disjunctions and negative conjunctionsr Which of the many 
disjunctions and negative conjunctions is the program to choose for 
applications to rules R9 and Rli>? 


One way of attaching this problem is to let the decision 10 Select 
a particular disjunction or negative conjunction represent the node of 
a goal tree. Although the use of goal trees is a common approach in 
artificial intelligence research, it will not work her* because a good 
method for evaluating these nodes la not readily available, 

bet us eunmarize the situation Which has just been described. 
Reasoning hy cases offers the Opportunity to decompose a problem by con-“ 
sidering each case separately and automatically erasing all formulas 



that were senerated during the attempt tc Solve a cast before proceeding 
to any of the subsequent cases*. Cn the other hand* reasoning by cases 
could lead to a disastrous aitplog4cm of subprob1ferns under consideration 
hy ths the area p rover. How then do we prevent Such an explosion, from 
taking place? 

The escape from our dilerrea turns Out Co be surprisingly siniple 

and one which is likely used hy hximun theorem provers as veil. The basi 

idea is to determine whether the so lotion to s caEd- depends upon the 

hypothesis of the ease; for if no Such dependence is found* then none 

of the subsequent eases need he considered. 

Recall that the system is described by a list (Lp* L., E^ + L^+ 

. K , L * L , n )where K, controls the ease analysis At the 

n n n mL i 

level of the coal-subgoal hierarchy. We aay that formula F depends upon 
if either F is the hypothesis of one of the cases of DL or else the 
derivation of F utilized this hypothesis at least once as an input to 
one of the deduction rules* tie then define U(F) = dependence of formula 
F = the set of thoSt K. upon which F depends. We next define D^ ■ 

* 1 3 t 

dependence of the solution to rase t of Ik = the union of all D{F) taken 
Over all formulas F shat appeared in the aolution to case t of K,. In 
particular. Suppose the Solution to case t of E^ itself utilized a 
reasoning by cases argument. Then this additional case analysis must 
have been solved under the control of some E. , Letting n 

1 “t _L i t A 

represent the number of casts that were solved in the case analysis of 
E. + ^ and letting + 1 0 represent the dependence of the particular 
disjunction or negative conjunction which generated K. + ., we would 




obtain t as the union of all ^ + ^ j taken over all luteRare j far 
vti icb 0 < j 

Upon obtaining a solution t& taS& t of K. , the program asks "Is 
Jt_, o member of D_^ t ? ri If the answer is yes, the program goes directly 
to the next case of However, if the answer is no, chon it skips 

over all the ranaining oases of K. and instead immediately concludes 
that it has solved the currently active case of ¥._^ . 

Furthermore t before a case of K actually is attempted, the program 

first examines the lists K ~ for all j < i in order to determine whetbe: 

Jr “ 

the case had he*n solved already. If the answer is yes, the program 
assigns the sane dependence to the solution of this case as prevailed 
fox its previously solved duplicate and then skips over this case by 
proceeding directly to the next case of k ; . 

The program also would not attempt to split a formula P into cases 
if one of the cases of F represented a sped alia at ion of a literal 
already appearing On seme (i*&., there is no point in examining any 
bf th& oases of F unless each of these cases is supplying us with some 
new information) . For this reason* a further restriction is olaced 
upon the specification of an existential variable. A Specification of 
an existential variable is allowed by only if it does not transform 
a subsequent Case of into a special!cation of a literal already 
appearing on an L for some i < n. 



We have said nothing as yet about the crder by which disjunctions 
and negative conjunction* get activated by rules R9 and RIO, Associated 
with every ftoal G is tha list S(C) of those disjunctions and negative 
conjunctions that were available to G at the tine of its activation. 

The attempt at solving G with cut utilising rules E9 and Hit) results in e 
new list TfG) consisting of the elements of 5(G) followed by those dis^- 
junctions and negative conjunctions that were created during this attemptr 
If it i® decided to sprout subgoals from G, then the first available 
formula F oci list T(G) becomes the instrument for the split. Suppose 
■C 1 is one of the cases obtainsd from the split of F. If no two cases 
of F have the same variable in ccaeoft, then S(G’> would be T(G) with 
formula F removed as there then would be no need to split F more than 
once. However, if the same vartabls appears in more than one e&9£ of F, 
then we may wish to seat additional splits of F at a Iatar date and so 
SfG 1 } would he T(G) with formula F transferred from the first to the 
last element in the lisf,^ 


If the attempt to solve without E9 and R1Q fails to fin-d a 
single new solution and an existential Variable appears it G', then F 
I* abandoned and a special mark is plated on F to inform the program that 
F no longer is available for a spite, Howaver, if the attempt to solve 
G f without R9 and ElO fails to find a solution but no existential variable 
appears in G' s than G 1 would become a parent goal whereupon this special 
marh would be removed frem all formulas cf I{G), The rationale for re¬ 
moving these marks (and thereby providing new opportunities for Spiles) 
is chat the use of G' as a parent goal would provide new information that 
had not been available when these marks criginslly were inserted. 



There is still one loose end that needs to he tied. We have said 


that in attempting to solve a case ve first seek a Solution using all 
the deduction rules except R9 and RID hut impose a Raxtmuns lifflit on the 
time spent looking for such a solution. The reason for this maximum 
time limit is that we dc not wish to cemmit too much of our computational 
resources in this attempt when Additional c$e of rules R9 or RIG may 
he necessary. Therefore, if the time limit expires before a solution 
can be found and the program does not wish to abandon the goal, it would 

remove from the system those formulas ^ . F which have not 

yet been processed by the deduction rules and combine then into a single 
compound formula 'Ha = a) V (F^/NP^ . r , T ) r This compound formula 
then is placed at tha bottom of the list of disjunctions and negative 
conjunctions that are associated with the current goal where it would 
be applied to rule R.9 only after those formulas which precede it on the 
list* This allows the program to continue work on a goal by utilising 
a reasoning by cases argument without waiting to process all the for¬ 
mulas F,| , . 7 t n, If it should turn out that one of the formulas 

F^i F is necessary for the solution to the goal, then 

eventually the disjunction Ha * . . . , .^\F ) would get split 

by rule S3* In that event* ease 1 consisting of the hypothesis Ha m a) 
would be solved trivially and. lySF^ hl ♦ * */\P (the hypothesis of case 2} 
would be decomposed by rule R3 thereby reestablishing the formulas 




5 . 


Satie Facilities fog Re-pregent ing Pr ocedural informatiori 


When presented with a new axiom, a human often will hare some ideas 
about how the axiom is to he used, A program that can absorb these 
ideas has an ad van t ape when it c&mes to solving actual problems. The 
present profiran has three main channels through which such information 
can be received and utilised. 

First, as mentioned in Section 3, the procedure by which an axiom. 
A^h ia used in a problea is different from the procedure associated with 
the logically equivalent axiom '-A'v'te; the first formulation is used to 
help solve a case, whereas the second formulation is used to split a 
problem into separate cases. 

Second, a routine, associated with an input F to $ two input 
deduction rule, car. decide whether a particular formula should be 
paired with F as the second input to the Tule. Third, descriptive 
information, associated with one of the inputs, can be passed along to 
the output of a deduction rule; this descriptive information might be 
a factor later in deciding whether this output formula should be accepted 
as a second input to a particular two input rule. 

For example* before the formulas AT>B end A 1 are applied to rule R6, 
the program checks so SC* whether a special attribute appears with ATE, 
If this attribute is not present, then the program would continue its 
attempt to apply ATJh and A’ to tulft R6. However, if this attribute 
does appear with A T?E, it would have Some IFb-V routine E associated 
with it; the program then would execute the routine E using the formula 


A 1 as input data to the routine. Under these circumstances T the 
decision to continue (with this application of AUB and A’ to rule R6) 
would be made on the basis of the result obt-Ained frees this execution 
of routine R. Thus, e routine R associated with the formula xEGTjx ^CG 
might reject (a 1 ) 1 CG as the second input to rule R6 in order to 
avoid an endless application of rule PS to jiEGDi ^CG. Another way to 
handle this example scecs from the fact that any attribute appearing 
with formula B gets transmitted to the output formula &’ upon the 
successful application of A^B and A 1 to rule B6 . Thus, the routine 
R might reject A T if it determined that the creation of A" occurred as 
the Output from a previously successful application of rule R6 to x£GD 
x routine R could make such a determination merely by checking 

CO soft whether a special attribute that appears with y. ^eG appears also 
with A'. 

There is one attribute, known as the "expansion" attribute* which 
is processed by the program Ifl ft Special way* Thus, if the expansion 
attribute appears with a literal, then the program does not allow the 
literal to be W&ed as aft input Co any of the deduction rules except 
rule Rll and then only in the role of A(r f ). After Each previously 
generated formula of the type r * t has had a chance to he paired with 
this expansion literal for a possible application to- Rll, the expansion 
litoral is removed from the system. For example, the placement of the 
expansion attribute with X*z > y*s in the formula & S* y > y*r 

would causa this latter formula Co generate expanaiens. Thus* formulas 
b > c and x > yDs*J > y*i applied to rule R6 would generate the 



expansion b*t > c*z. Siin.il.irly t the application of ■ t) vith the can 

collation lav x*z. > y*z U k *» y to rule R7 would generate t(h*K - 0 * 2 .) 
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as an exp anaion. 

6 ■ The Rpoblem Solvin g Ex ecutive 

Any problem solving program must have soioe overall scheme for 
allocating its computational resources. This "global 1 " allocation 
already hes teen described for the present program in Sept ion 4 in 
connection with the implementation of rules R9 and RIO* However, the 
prosent program also must conduct M local"' Searches which are character¬ 
ized hv an attempt to find a contradiction without further use of rules 
R9 or RIO. These local searches must likewise have a procedure that 
controls and guides the computational effort; we describe now this 
local allocation. 

A non-literal is given priority ahead of a literal when it comes 
to deciding the next formula to be removed from + , and applied 
to the deduction rules. Amsng non-litersls. the order is first come 
first served. Among litorals f priority is determined cn the basis of 
a lexicographic ordering which. eh&Oses the literal which £1} depends 
upon the fewest number of the K_^ (i.e., SO that a solution obtained 
from this Literal would have a hutict chance of not necessitating the 
solution of too many additional cases) , and in ths event of a tic 

^^However, we would not generate the expansion if either b or C 
were of the form u*v where 0 and u were existential variables since we 
wish tp avoid the effort of trying to create a split by solving fer 
one existential variable in terms of she other. 



attempt b to choose a literal which does not possess the asp ms ion 
attribote, and in case of still another tie chooses the literal which 
(3) has the least c-oraple-Kity where complexity is measured by the storage 
space taken up hy the literal. 

However k in trying to find a direct solution to a cast involving 
existential variables, we do not process any expansions if a solution 
already has been obtained. The reason for this is that the processing 
of expansions is too expensive to justify their use when looking for 
additional splits since we can do sc at a later date if the current 
split should prove insufficient. 

7. The Equality Relation 

It should be noted that unlike the treatnent of equality by resold" 
tion based theorem provers [l3k rule Ell does not permit the replace¬ 
ment of a tern in a fonrrula unless that formula is a literal nor de&£ 
it allow this replaceaept to he made on the basis of an equality r = t 
unless the truth off r * t already has he*n established. The reason 
we can do this is that the reasoning by csbcs mechanism serves to detach 
the individual literals from a formula as it analyses the separate 
Cases SO that eventually these literals Will be available for use by 
rule kll, however,, the advantage in postponing these replacement* is 
that it keeps apart those formulas used in solving one case from for¬ 
mulas used in Solving a later case with the result that formulas .Stemming 
from different cases do not interact with each other to produce additional 
formulas. Also, since thfc program does not always have to consider 



all caeca arising from a Split (as was discussed in Section this 
postponement enables the program to avoid the necessity of generating 
furtulas fre» a later case if the current case should prove to he 
Irrelevant to the solution of a higher level goal. 

In rule ftll we impose the requirement that neither r nor r 1 tan 

be variables; for if either t or r" were a variable-* then the match of 
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r with r 1 always would he trivially satisfied. The significance of 
this restriction is that It limits the application of rule Ell to 
situations where its success fill application would provide ns With Some 
’’information 1 ’ in the sense of I 16] (i+t,, the success of a rule givCe 
us no ’’information" if its Success is a foregone conclusion) , The 
practice! usefulness of this restriction is that it greatly reduces 
the number of formulas generated by rule Ell with little risk that 
one of the discarded formulas will be necessary fox the solution. 

In applying the equality b =■ c as an input to rule Ell f the 
prOS TSCl will identify b with r and c with t (i.e. a it will replace b 
by c TBthcT than replace c by b) on the basis of the following eight 
conditions, where condition i takes priority over condition j for i < j 
(!) c appears as part of term b, 

(2) store variables appear in b than in c. 

(3) t is * constant which appears in a special list provided to the 
program by the user (so far, this list has consisted only of identity 
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This assumes of course that r does not appear as a evbeiewnc cf 
r 1 and vice versa► ' 



s vrflboLij such as 0 and 1) , 

(4) a special attribute appears With the -equality b ■ c which tells 
the program that the right side the equality should be substituted 
for the left side {so far* this attribute has btutt used just once 
and that was to denote that b had Sighifitauce only in its capacity as 
the definition of c), 

{5} b* hut not c, represente an associative product {i.e., b is of 
the form r*t where * obeys the associative law; see Section £ for 
discussion of associativity), 

(6) neither b nor c represent an associative product , but b is a 
function of more arguments than c, 

(?) both b and c re ore sent associative products such as a, .*a 

11 n 

where a, the number of terms in the product, is greater for b than it 
is for t, and 

(0-) b is of greater ’’complexity" than c where h as in Section the 
complexity of an expression is measured by the storage space ic 
oc cop Los* ■ ■ ‘ ;i - ■ ■ 

If neither b nor c C*n be identified with r on the basis of the 
above eight conditions, then an arbitrary choice is made. If a decision 
is made to identify b with r but an attribute appearing with f omul a 
b ^ c Indicates a desire that both sides of the equality be given this 
opportunity* then c also would bt Batched with r aiid if successful 
the output A(t’) would be designated as an expansion. 




If in rule Ull (1) one of the abfl’uC eight conditions does prevail 
for r ** t> (2) A(r') is not an expansion * and O) the matching of t to 
r T does not reduce the generality of r‘* then the generation of A(t') 
by rule Hll allots us with reasonable confidence to eliminate A(r’) 
frofl further consideration provided that the new formula A(t 1 ) does not 
depend upon any of the m not already depended upon by Aft 1 ), The 
program takes advantage of this by employing rule Ell as thE first 
deduction rule to be applied to a literal! the literal is given the 
role of A(r r ) f R rule Ell and different equalities r = t are paired 
with it In the hope that one of these equalities will lead to a quick 
elimination of the literal* Indeed T if the application of one of these 
equalities tc rule Rll generated the literal L 1 from the literal L 
without eliminating L hut a subsequent application of a different 
equality to rule Rll did eliminate L, then the program would eliminate 
the literal L’ as will. 

tot have already restricted the application of rule Ell by not 
allowing the specification of an existential variable (i.e., we required 
in Section 4 that an existential variable could be specified only if 
the, specification occurred during an application of rule Eli■ We now 
place a further restriction on rule Rll by not allowing any variable from A fr 1 ) 
to be specified I. [l.e., the match of r to r T in rule Ell is not 
allowed if it. reduces the generality of r r ) unless Atr 1 ) is either an 
equality b =■ c s or its negation *Vfb m c) h or an expansion. In order to 
compensate fot these restrictions on the treatment of literals which 



do not involve the equality predicate „ we included R12 as a rule of 
Inference as this rule takes two such Literal-? ay input and products 
the negation of an equality as output . However, since out of the 
motivations for rule RLE was its compensation for the restriction of 
substitutions into variables, we will require that at least one of the 
two input literals to rule R12 must possess a variable, 

S, Associativity and Cozciiutativit y 

Functions which are either associative and commutative or Just 
associative play a fundamental role in mathematical reasoning. For 
example, addition and pult ip 1 i cn t i On in ordinary arithmetic each 
satisfy both the associative .and commutative lews. (hi the other hand, 
the multiplication of operator? (such as found in matrix multiplication) 
provide important examples of functions which satisfy the associative 
but not the commutative law. In view of the great importance cf 
associativity End commutativity * special routines were built into the 
program in order to provide a more accurate simulation of human problem 
solving as well as to exploit better the power which is available 
whenever it is known that a particular function satisfies tithor both 
the associative and commitatdue laws or jus t the associative law. 

An associative function f is one which depends Ofl two arguJa&fltS 
and satisfies the relationship f(x, £(y,c>) * £{f <!t,y) f«r ail * > y > 
and z ■ It follows from this that the expressions f{a,f(b,f(c,d)))„ 
f(f(a T b),f(o.d)) , and f(f(f(a,b),c),d) ate all equivalent if f is 
associative*- Using the more familiar product symbol * in place ef f. 



the. above expression. cat* he written us aMh*fc*d)), (**h) * (c* A) , and- 

((a*h)*c)*d respectively. Clearly* the key feature of an associative 

product is that it is independent of the way the parentheses are 

grouped, A hmnan who y*fc? itt associative pxoduc t acknowledges thi*. 

fact by Writing the above expresslohs as a*t>*c*d; at one stroke he 

the re fc re saves processing time; as well as memory by avoiding the 

necessity of treating these equivalent expressions as distinct entities. 

Although an associative product fs defined formally as a function of 

two arguments, it is used by humans informally as if it wore a function 

of an indeterminate number of arguments s„ s* *.* s where m can 
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be anv integer greater than 1. The same point of view is adopted by 

the program which* foT an associative function f t strips the parentheses 

from different expressions involving f by reducing them to the canonical form 

f(g^* a n , s^) ■ Thus, the program immediately would reduce 

f(f<S 4 , fU 5 , s 3 J), f(f(s lt s 2 >* s fr )> to f(e 4 , s^, s 3 , s,, s 2 * s & > 

if it knew f to be associative. Throughout the remainder of this section 

lc will be assumed that the function f ig aggoefiitive, 

Standard match routines* such as described in Section. 2, can reduce 
and eventually eliminate the differences between two expressions A and 
A' only if A and A" have the same structure (i.e,, only if in those 
places where the two expressions A and A r differ, a variable appears in 
one expression which can be equated to the terc appearing in the 
corresponding part of the other expression) . However, for dealing with 
associativity {and especially for commutativity) a more generalised 
method of matching is useful (such as the pattern matching of [&]) 




which tan rearrange the position of terms within a structure as well 

as dat arming values for variables located at fixed positions * 

The present program utilises a routine MATCHA which tan bring 

into correspondence two associative functions f £s, , + * + + +, S } and 

1 ill 

f£c^,-- +» t n > even though at and n may not be equal to each other. 

The execution of . MATCHA(f {a, „ , , , , . , s ) , ft >> operates 

-L re 1 n 

as follows. For purposes of exposJtiern, we extend the definition of 

f to allow it to depend on only a single argument by defining 

fC-s.^ = a . With no loss of generality, ue assume m < n. If in - 1, 

an atteapt is made to bring into correspond once with , 

perhaps by a substitution of certain terms for variables * after which 

an exit is cade from routine MATCHA, Suppose m > 1. We first attempt 

to find a substitution which will make a. identical to t^ and if 

successful, we then execute MATCttA(f Cs~». * + + TT s >, f (t* .. t ]0 

- m i. n 

far this substitution* After . MATCElA(f (s,,,*, ***, s >, f(t,... t }) 

*■ in £ n 

has been executed, we undo any substitution of a term for a variable 
that Sight have been needed to make s^ identical to t^„ At this point, 
Iftt-aorSj is not a variable, we exit from the routine MAT C HA. Other¬ 
wise, beginning with r ■ 2-, we set the variable s^ equal to the teit 
f(tj S ,*, t ) (provided of course that s- does net appear in f(t, , 

and. then execute MATCHA{f<a - & ), f(t . . t )) fcT thia 

e r r + 1 n 

substitution after this has been tried for all integers t Such 
that r > 1 and m - 1 < n - (r+J) , an. exit Is made from the routine 
MATCHA. For example, if x and y are variables, then the execution of 







MATCHAff(x s y) , f {fi , b, c)) would produce two successful matches corraspending 
to K “ 3, y ■ f(b p c) and x = f(a,b) t y = «, $ imiEarly, the execution 

of MATCHA(f(x,a,x h a p b),f{b,c,a,b,c,y)) would produce only one successful 
match (i.c. p for x - f {b h c> „ y = f(a,b)). 

The rep I a cement mechanism in rule Fill Is designed to take advan¬ 
tage of knowledge that a function f is associative. Thus, suppose t and 

r 1 in rule HI 1 are terms of the form f{ t^ .. rj and f[rj. r^) 

respectively where m < n, For each Integer j such that 0 < J < n ~ m, 
the program would attempt to find a substitution which would make r. 

■denticai to r', + . for all 1 “ 1,2. m. If the prog rats is successful 

for some J, then the output of rule Hi I for this j 'would be of the form 

Aft 1 ) if m = n, A(f {t H P r f J )+1 t -r^) ) if 0 = J < n - m, 

A[f [r] ,*♦ + + ,rl ,t r . >•- r* )) if 0 < j < - b, and 

Aff frj j.. . ». Tj , t 1 } ) if Q C J *• n - m, For example, f{x,x) ~ t would cause 
A[f(a,b t b ± c)) to be replaced by A{f{a t e ± t)) where m = 2, n - 4 and J - 1. 

Art additional replacement routine is available for use when n< n 

and sone r, as a universal variable. Thus T for each Fntegcf j such that 
D < j < n - nip the urogram w>u I d execute a routTne 

MATCHB(f{r lt .,.-r^) + f(rj + ,The routine flATCHB is the same as 
routine HA CHA except that (1) it does not allow a substitution which 
reduces the generality of ffrj +] , , t ,.r'J and (2) It proceeds 
differently when it reaches the point where, For some p > j- + m - 1; 

it must execute KATCHB(f(rj h F(r^j th * *,r’)). Instead of execut¬ 
ing MATCMSff fr^) ,f (rJ +] ,. r^ ) by trying to bring r„ | into corre 5 - 



pondence «ith f (r 1 (as MATCE1A would have doneJ it tries 

P T 1 U 

to bring Into correspondent* with Fft^ + ^.r') for seme integer 

q such that p < q < n. If the program is successful for seme j and q T 

then the output of rule RII would he Altit*)) If 0 = j < q = n * 

A(f ft r h r' , , -. i , r') ) If 0 ■» j < q < n, Aff (r, r t ,, > ,r" w t*, t' , ,,,,,, 

q “ ± ii X j q -r ± 

r^>> if 0 < j < q < n s and Aff(r’ „ *.. + r^,t 1 )) if 0 < j < c = n. For 

example* £(x,a) = e would cause A(f(a,b*e,b,c,a>) to be replaced by 

A(f fa i .B > a)>ii where m = 2, n = 6, J ** 1* p » 3 and q ■ 5, 

For Che remainder of this section it will be assumed that the 


function f also satisfies the eorauitatlve law (l.e., f(x,y) = f(y*x) 

for all x and y)„ The program U3« the routine HATCHC to bring into 

corresocndence two functions f (s. }. and £(t, t ) when it is 

I ...m .1 _ n 

known that f is commutative as well as associative. The execution 

of MATCHG ff (b,h * * >e )* f(t.*+*+t )) operates as follows. Kith 

no loss of generality, we assume m < n, If m = I* an attempt is made 

to bring s 1 into correspondence with f(t.+■*«t ) after which an exit 

-L in 

is made from routine MATCHC. Suppose tt ^ 1+ fc?e first attempt to 


find an s_ (giving priority to these i. which are not variables! 

far which a substitution can be found that makes s. identical to t 

for some t , If we are unsuccessful* we exit from the tontine MATCHC, 

However, if we are successful for some s, and t., we execute MATCHC 

1 J 

(f(s lJ .,., tBj _ v 9 ± + i .V* f(t l . C j - 1- C j + 1. V } 

for this substitution and then exit from the routine MATCHC. 


Tho Execution of rule Ell, when it ip known that f 1$ cc“iitative 


as well as associative* is governed bp the routine REPLACE- Thus, 



suppose r and r' in rule Rll are terms of the fa no £(r^±*.*■*r^) and 
£(rj t .±r^> respectively where m < n. The execution of REPLACE 
(f(r^*■*■*F f m )» f(r^ 1++++ tt^)> Operates as follows, Beginning with 
j ^ 1, the program attempts to find a substitution which would make Tj 
identical to r’■ If the program is successful and ra > l f it would 
execute JtEELACE{ f(r*,*+++,r ) , f (rl , . 




■1 .‘m' 1 “‘1. r J - 1‘ r J + I’ 1 

for this substitution* If the program is successful end m “ 1, than 

the output of rule Rll for this j would be of the form A{f(t r , E t f „ „ „ „ t 

r’ r! + ^,r^)) - The program carries out this procedure for 

each integer j such that 1 < j < n (unless a j is found which produces 

an output for rule Rll that allows A(r') to be eliminated}. For^r 

eKample* * e would cause A(f (h, a*b ,t>> to he replaced by 

A(f(e ± 4 T c>>, 


9 * CoTapu.tatiur.aI Ex p erience 

The theorem proving program described in this paper waE written 
in IFL-V and run on IEM 360/50 and 370/145 computers. The maximum 
partition available to the program was £50,000 bytes cf core storage* 

This amounted to 22,000 IFV-V words {after loading the ZPL-V interpreter) 
of which 7,000 were consumed in loading the program leaving 15 ,000 
IFL-V words for actual wotV- space■ Although the available memory was 
not large by current standards , computing tire was mere of a limiting 
factor than memory, for IFl^V is an interpretive language and therefore 
vetry slow in execution* The ptogrsm might well have run an order of 
magnitude faster if it had been written in a language that was capable 
of execution in a compiled form. The computing times of the examples 



reported in this section are fop the 37Q/'145, 

Among the theorem proved by the program were all nine problems 
from group theory and number theory vhich were reported in [4j. It is 
also capable of solving much more difficult problems than these as 
evidenced by the examples to he described in this section* It accom¬ 
plished this without the uep of any bounds on substitutions * 

The following interpretations are used with the examples of this 
section 2 

(I) K & y means "s Is a member of set y 11 * 

(23 *C 7 means ”x is e subset of y” f 

(33 x = y means ’’set m is identical to set y 1 ", 

(4) i Uy means ” the union of sets k and y", 

(5) x ri y means ’’the in ter section of sets s and 

(63 x - y means ’’the set obtained by taking set 5t and removing from 

if all elements that appear in set y", 

(7) U weans " the universal set Consisting of all elements 1 '* 

(6) SB(x) means “the set of all Subsets of x M p 

(93 * means " the associative product for the group 1 '* 

(10) e ^eans " the identity element for the group' 1 * 

(II) s(x) means "y is a subgroup of the group" 

(12) p(S*T) menas 'the product set KT consisting of all elements x*y 
such that x £ It and y £ , 

{13) prime fx) means "y is a prime ¥iumhfcr'\ 

(14) x|y means r 'x divides y" p 



(15) ratlore1 (?0 means ’’a is a rational number", and 

(16) gqrtfit) means M tbe scuarc root of K’ l1 + 

Example _l r The set t>f all Euhscts of A intersected with the Sot of 
all subsets of B is identical to the sel of all subsets of A interBeetion 

Al► ^(f(s,y) ea 3 f (x,y) sy) a ffU^y) ty 3 f(x,y) 3 x = y 

Al. h S y 3 3 Kry!) A (zF.y^) zek) 

A3. (zEX Ae£ y) 3 e£x n y 

A4* i?sn y 3 (e£x s\ Ety) 

AS.. (jc * ae ^y)D 2 C 
A6 . ?CkAv d (zcnAicy) 

A?. {zc,k^ 25 U)D zt - se, C k J 

AS. z£$E(x) O (ZCSA\2.£U) 

A 9 . a,(S*<A) n &B(B> s SB (A nB)) 

Hit program proved eisn^ile I in one minute from the above set of 
initial axioms and generated 36 new formulas la the process. Although 
this theorem had. been pToved in [£],. that program utilised routines 
char were especially designed for set theory, however* this has not been 
an easy theorem for general purpose theorem provers. VTiat is perhapE 
remarkable about the effort of the present program is that it did 
not generate a single formula that was not necessary for the proof 
(l.e., each of the 36 additional formulas generated belonged to the 36- 
Step proof produced by the program). 



I The square root of every prime number is irrational. 



Al. n*(y*z) " 

A£. x*y = y*x 

A3, k = yUK** " y*2 

AG, x*E = y*2 ^ X “ y 

A5. e*2i = s 

A6, y*3i = x D y = & 

A7. sqrt [x)*sqrt (h) = *- - 
AB. y/y+x 

A9. y/x^K = yAh(ij,y> 

A10 . prime (*:} ■ ID ^(1/7*1) vx/y v a/s] 

All. r “ prime(■]' 

A12, rational(x) 3 {f(*) - (y))v^fy^(x5) v 7 “* E }} 

Al3. * [prime (a] 3 "■'rational (sort <a>) ) 

The program proved esunple 1 in nin^tes and generated 1-5 3€i-naw 
formulas in the process. Al though example 2 has been the object of 
considerable attention in the literature 114 ] s [5] s [lilt the present 
program is th.fr first to pTove eViib theorem without the aid of special 
hints that reflected ft previous knowledge of the proof. The program 
produced a 3-0 step proof which is r&prfrduefrd bel&w, Since the basic 
operations for equality, SUfrh as its reflexivity x = i, atfr implicit in 
the operation of the program, they are not mentioned directly in the 
proof 1 Also, no direct mention is made of Al and A2 since the effect 
of these axioms is implicit in the choice of match and replace routines 



used by the program as described in Section 8, 

Frucf of i^ainple 2 : 

A14, prijnefaJ Rule P.5 applied to A13. 

Alt, ^ A, rational (sqrt (a)) Kelt R5 aopiied to M2„ 

A16, rational [sq^E(a)] Rul* E2 applied to A15. 

M7, Ma/ynr) Va/yva/a Rule RS applied to Alfl anti Ali* 

A18, [f [sqrt (a)] = sqrt (a)*g (sqrt {a)} j ['u(y/f [aqrt <a>) J 

'V^fy/a (aqrt(a) ] J vy ■ ej Rule R$ applied to A12 and A16 . 

A19. f (sqrt (a)j = sqrt(a)*& [sqrt(a>) Rule K3 applied to A1&. 

A20, n.(y/f i'gqrt(a))J\Z^(y/g(*qt-c{o})) vy « o Rule R3 applied to 
A13. 

A2l. sqrt (x)*sqrt 00* = - x*s Rule RG applied to A3 and A7. 

A22. aqrt(a)frg(aqrt(e>)*z - f[sqrt(a))*z Rule R6 applied to A3 
and Al9. 

A23. sqrt(a)*f[sqrt(aj] * **£(sqrt( 4 )] Substitution of A19 into A21 
ns;Tif Rll. 

A24. **s(sqrtU)}* g (fiqrt(a)) = f(sqTt<*))*ffsqrt(a)] Substitution of 
A23 into A22 using Rll, 

A2 5 , Ma/y*z) Use 1 &£ AIT, 

AS6, Mate - y*z) where j 1 wd j are the same variables that appeared 

ih A25 ( i. a , „ y and z have bt-en given an existential interpretation). 
Rule Rl2 applied to A6 and A2j. A contradiction is cbtaired between 
A24 and A26 for y = z - f{sqrt(a>) and x - ft (sqrt fa.)) *g [sort fa) j . 

This causes AS 5 and A26 to be replaced by A27. 



A27. a/f I-Sflrc (a} i ; sqrt (r} i Cases 2 and 3 of A17 for y = z = 

f l g q r t {j - 

A2B, ^(y/f (sqrt<a>)) Ca^e I Of. A20. 

A29. A^yjtn " f : sqrt{a)] I where y ie the seme variable that appeared 
in A2&. Rule R12 applied to AS- and A?S, A contradict ion is 
obtained between AI9 and A29 for y = sort (a) and m = g[sqrt(a)] , 
This causes A2B and AZ? to he replaced by A3f>, 

A3Ck n^[sqrt (a)/j feqxt (a) )j V (sqrtfa) = c) CaEe 2 and 3 hi A20 
for y ■= sqrt(a). 

A31+ a/ffsqrt(a)] Case 1 of A2 7. 

A32 + f[sqrt(fl)l - a*ti(£ fsqrt'fh)) >a) Rule Rfi applied! to A9 and All. 

j\33 + a*h(f (sqrt (a}) ,a] *Z = f(sqrt(&))*z Rule A6 applied to A3 and 
A32. 

AM. a*b[f.sqrt(a)),aj*gqrtCa) * a*g (sqrt (d)] Subetitution of A23 
into A33 using rule Ell, 

A35 * h [f [sqrt (a) I ,a] *sqrt (a) 3 gfsqrttaj]- Rult R6 applied to A4 
and AM, 

A36. ^(sqrt(a)/g(sqrt(a)]) Case 1 of A3 Ah 

A3?. M^scTC(a) sw ■ gfsqrt(a)l) Rule R12 applied to AS and A36, A 

contradiction is obtained between A35 and A3?. This caus&S A36 
and A37 to ba replaced by A30. 

A3B. sqrt(a) = a Case 2 of A30. 

A3 9* e*aqrt(a} “ a Substitution of A3fi into A? uslofc rule Rll* 
aqrt(a) 3 a Substitution of A3 into A39 using. Rule Rll. 


A40. 



AAI. c » a Substitution of A3E info AGO using pule J?H. 

A42. prime( £ ). Substitution of A41 into Al4 using rule Ell. 

A conEradicCion is obtained between All and AA2. This causes A31 
through A42 Co be replaced by kU 1, 

A43. a/f (sqrt (a)) Ca.se 2 of A27. The program. solved this last 

rase tie rely by noting that it is the same as a Case which bad been, 
solved previously (i.e.* it is the same ns A31}■ The proof of 
the the opera now is complect since all outs tending cases have been 
solved, 

Ej^ii-ple 3 .: Gt-£u r s three axioms are sufficient Co define a ternary 

boo l eon algebra, 

The following five axioms define a ternary boolean algebra, 

Al + f(f<x,y F u) T v, f(i p y T w)] - f[ic,y.f 
A2 , f {y F jt ,x) * st 
A3- f(*.Tp£(7)]" * 

El. £{x f x p y) = st 

£2. f (gCy). y.x) = x 

The object is to Shot? that axioms £1 and B2 both follow from axioms A1 
through A3. That this . in fact.could be..dsne was announced in the 
mathematical Literature [6] but no proof was presented. It was proved 
subsequently by on interactive thaor&m proving program [ll which 
Utilised a hufcao participant in the proof finding process. The following 
IE step- proof is quite different from the one in [1] and did not 
involve aPy human participation. The program is present ad with axioms 



Al through A3 as well as the denial of B2 , It was not necessary to 

deny El since El was produced in the course of mrovin^ 62 H The proof 

toot. 110 minutes during which 2^5 nav formulas were created, 

A4. 'Ll f £g {a} , a ,b] “ hj Denial of B2. 

A5. f [y,T,f (ij.w)) - f [x„y ,f Cy,v.e)) Substitution of A2 into 
left side of Al using rule Rll, 

W- f(yiV,y) * f (s ,y,f (y*Tf t y)) Substititiofl of A2 into left side of 
AS using rule Rll. 

A7 t f (y ,V T f <X T y ,v)] - f (jc,t,v) Substitution of A2 into right side of 
A 5 using rule Rll. 

AS. f(f(v ± Wju} t u + fO^w)) - ffu*VtV) Substitution of A7 into right 
side of Al using rule Rll, 

AS, f[fCvjW,u)* v T u) - ftu.VjW) Substitution of A2 into AS using 
rule Rll, 

Alt), fCxj3c,y) = ffg{y> s K s y) Substitution of A3 into left side of A9 
using rule Rll, 

All. x - f fg(g(x>) .Jijgfi)] Substitution of A3 into left side of AID 
using rule Rll. 

Ai3. a(s(*))■ x Substitution of A3 into All using rule Rll. 

A13■ f£*»(Cy).y) - K Substitution of All into A3 using rule Rll. 

A14, f(v h g{sj f fG^y**)} * f <X,jr,y) Substitution of AL3 Into right 
side ox A5 using rule R11+ 

A15. ffy p g{£> p f(x,y,K^) - j Substitution of A2 into Al4 using rule Rll. 

A16, f (y^s(afy)) *x) - y Sub Stltution of A3 into AlS using rule Rll. 

AL7. f{x p x p y) * x Subscleution of A12 Into Al€ using rule Rll. TTiis 



proves axion; SI, 


Al£- ffn^y.iO = k Substitution ojf At into A17 using rule Rll, 

AJ-9* f[g[y),5i,y) = k Substitution of A17 into A1Q using rale Rll. 

A2Q. ffy^ t g{y>) » K Substitution of A12 into A19 using rule Rll. 

A21. f(j; 3 y t *) - f[g(y) h y ± K ) Subs ti tut ion of A2Q into left side of Afl 

using rule Rll. 

A22. f (afy) >7**] - x Substitution, Of AlS into A21* this completes the 
pfoof since A22 contradicts AA t 

^ajnjlc 4 : In a group„ if xnx*-* = e and f(:t h y) - 3 «v fcx ~ 1 *y ^ 1 for til 

K and y, then F|f(a 1 b) t b) » e 

Al* n+{y*E) = fx*y)*z 

A2+ n = yr7x*s “ y*2 

A3* r = y^s*n » z*y 

A^. x*2«y*23jK=y 

AS. Eft = z*y ^ x * y 

Ab . s*e = x 

A7 . £ *3 = x 

Afi. 3 *3 ^ = e 

A? * k ^ *3 = e 

AlO, m e 

All. f(Xjy) - K*y*3 ~*y ^ 

A12 - b) 

Example A was discussed in the appendin to [I3] in which it vaa 
shown that a paramoculatior: proof of this thooT-em could be found which 



took cnly 47 stepe whereas a comparable proof uaing ordinary resolution 
took 13S Steps, Hpwevtr,, these proofs -were not obtained from an actual 
procedure for finding proofs but were the product of an inspired human 
effort, &y Contrast, the first computer produced proof of this theotets 
vss obtained by the present program. This proof was 44 steps Long, and 
toofc 30 minutes during which 4L5 new formulas were eroa ted., 

Example o ' Let X. be a Subgroup of group G und Kog be the right cosbt 
of E in G fer some g&G. Then ¥ug is identical to K if and only if 
g ia a aieraber of T£. 

Al. Hftfyfrs) = (x*y)*E 
A2. r ■ y 3 x*£ ^ y*E 
Al* x n y 3 sax = 

A4* x*s ■- y*£ Z> * = y 

A5 * z*x - s ay 2 s = y 

A 6 . 37 * e = x 

A? , r.ftT = x 

A0 k ^ = e 

A9, k = o 

AID. fx" = x 

AL1, (jFtx,y),ex 3 f(i,y) Ey] 3 xcy 
A12* xqv 3 (sex 3 icy) 

A13*<zcj: v *Ey) 3 itxgy 
A14. rexUy^) {saxs/s^y) 

A15 + {sex^xzey) 3 zexny 



A16. =) (spK A zry) 


A17, A Me^)) D z£x - y 

Aia. iE K - y 3- (eex /s r “(z cy)) 

Ai9. Csc yAjfc«) J m = y 
A20. IX = yl^KC yA ycx) 

A2l- / ( [g(n>FK /A h<x) Cx) 3 KWahU) e*j 
\ CfiCxJ ex Z> (gOO)' 1 os] 

A22* # (zt) u feE£^\ (xei p x. "^£e] ^ (x x££)] 

A23, s(e> Z> {x*y£E V Mx£z) \y^(y£z)) 

A24, s(K) 

A25. JtFK.DE 3 fx = !L(x t z}#S A c(x, e) eE) 

A26„ x*s FKo z v" 'V(x£E> 

Ai7. ^((Kcg = It 3 B5£> ^ tog H K)) 

The program found! a 46 step proof to example 5 in B minutes during 

13 

which. 2l& new formulas were generated ' , 

faampla 6; If K and K ate Subgroups of group G, then the product set 
fl& is a subgroup of G if and only if HK is identical to EH. 

A1-A22. Same as in Example 5. 

A23. s(x> ^ [a(y}25 (eep(x,y) D (3 - m.(x ,y „ s)*n (x»y T z } A ,y p s)ex 

n(xiFiz>Ety>>) 




Axioms A2 through AlC were not all awed to interact directly with 
each other since the remaining initial axioms were placed in a sat of 
support 1201. The purpose was only to save a little computer time ejs 
these nine axioms were very familiar and their initial effects quite 
predictable t 



A24. s<h) D [a(.y) J (u*v &pfx,y) v*'‘'( ueiO vMtejO)) 

A25. sfa) 3 (t(y) ^ (e*w cp<x ,y) v' L < 5 = ««) v%(w = r*vO 
^(u£x) -s/MvCy) V' Mr*t £p(x p 7)))} 

A26, S(H> 

j\27„ ${K> 

A28.^( (pfH.lt) = p(K*H) 3 e fp ( H P K) ) )/\ (afpfH.K)) 3 p(ti.K) " p(K,H) 

The program found a 134 step proof to Exaiiple 6 in 72 minutes during 

14 

which 960 new formulae were created* 

To this author’s knowledge, e^apspt^^ 5 and 6 have never appeared 
before in the literature on automatic theorem proving- Each of these 
examples is noteworthy itl that the computer was confronted with a 
very rich set of initial axioms. Both of these examples Card especially 
Eraaple 6) should prove to be quite a challenge to machine oriented 
automatic, theorem Drovers. 
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